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In this article the Gordan theorem is apphed to the thermodynamics of a chemical reaction 
network at steady state. From a theoretical viewpoint it is equivalent to the Clausius formulation 
of the second law for the out of equilibrium steady states of chemical networks, i.e. it states that 
the exclusion (presence) of closed reactions loops makes possible (impossible) the definition of a 
thermodynamic potential and vice versa. On the computational side, it reveals that calculating 
reactions free energy and searching infeasible loops in flux states are dual problems whose solutions 
are alternatively inconsistent. The relevance of this result for applications is discussed with an 
example in the field of constraints-based modeling of cellular metabolism where it leads to efficient 
and scalable methods to afford the energy balance analysis. 
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Introduction 

The non-equilibrium thermodynamics of chemical re- 
action networks has been a subject of great interest 
and research efforts in recent years, from the pioneer- 
ing works on network thermodynamics [1 to the recent 
study of their statistical grounds through fluctuation 
theorems in [3]. The quest for general results or under- 
lying variational principles is open, as it is for out of 
equilibrium thermodynamics in general [5 . Regarding its 
applications, a chemical reaction network at steady state 
under the hypothesis of local equilibrium and well- mixing 
is the simplest (and standard) model of cell metabolism. 
This model has been applied to large (genome-) scale sys- 
tems only in recent times, where it was successful in de- 
scribing and predicting, even at a quantitative level, cel- 
lular metabolic processes[B]. In general, the stochiometry 
of the network defines linear mass-balanced equations for 
the reaction fluxes and thus a feasible solution space for 
the steady states. In particular, putting forward a func- 
tional hypothesis, one solution can be selected by maxi- 
mizing a linear function that describes e.g. the biomass 
or ATP production. In this way is possible to use power- 
ful methods and tools from linear programming, a frame- 
work that goes under the name of flux balance analysis 
(FBA)|7j. More recently, the question of the implemen- 
tation of the thermodynamic constraints has been posed, 
a framework has been called energy balance analysis 
(EB A) 18^ . This problem is connected on one hand to the 
calculation of reactions free energy, metabolites chemical 
potential and concentration [9l [TO] . on the other to the re- 
moval of infeasible loops from flux configurations [11]. All 
through the literature there is the intuition of the infeasi- 
bility of closed reaction loops, since it is easy to show that 
their existence renders impossible to define a thermody- 
namic potential. So, their exclusion is necessary, but is 
it sufficient to guarantee the thermodynamical consis- 
tency? In this article a positive aswer to this question 
will be given, showing that it comes from the application 



of the Gordan theorem, the oldest theorem of the alter- 
natives and a key result in optimization|12) . The applica- 
tion of the Gordan theorem in this context states shortly 
that, under the hypothesis of local equilibrium for well- 
mixed systems at constant pressure and temperature, a 
consistent thermodynamic potential can be defined if and 
only if there are no closed loops. This is substantially 
equivalent to the Clausius formulation of the second law 
in networks where closed reaction loops play the role of 
perpetuum mobile of the second kind and its demonstra- 
tion leads to address more efficiently the energy balance 
analysis of metabolic networks. Infact, more in details, 
the theorem reveals that finding infeasible reactions loops 
and/or calculating reactions free energies are dual prob- 
lems whose solutions are alternatively inconsistent. For 
instance, current methods apt to find and eliminate in- 
feasible loops suffers of scalability issues [13]. Then, this 
difficulty can be circumvented, by dealing with the eas- 
ier dual problem of calculating reactions free energy, that 
consists in solving a system of linear inequalities. In the 
following we will give an elementary demonstration of 
this theorem and discuss its applications with an exam- 
ple in the field of constraints-based modeling of cellular 
metabolism where it gives efficient methods to afford the 
energy balance analysis. 



RESULTS 

Suppose the stochiometry of a chemical reaction net- 
work is given in terms of a matrix S = (5'f), where 
stands for the stochiometric coefficient of the chemical 
species ^ ~ 1 . . . M in the reaction i = 1 . . . N. A flux 
vector V = {vi) is by definition|14j thermodynamically 
feasible provided the existence of a free energy vector 
AG = (AGi) such that 
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Definition 1 (Thermodynamic feasibility). 

i;,AG, < Vi 
r-AG==0 VreA"er(§). (1) 

Writing the free energy change in terms of the chemical 
potentials g = (5^) 

AG,; =^5,%, (2) 

and defining for sake of semplicity in notations = 
— sign(vi)S^, we have a system of linear inequalities 
equivalent to ([!]) 

EC"5.>0. (3) 

Now, the existence of solutions of the system (|3| is ruled 
by the Gordan theorem: the system ([3| has solution if 
and only if the dual system 

Definition 2 (Infeasible loops). 

E^fc^^O V/i 

i 

h>0, k ^ (4) 

has no solution, where the solutions of Q define closed 
reaction loops. In other terms, 

Theorem (Gordan theorem). One and only one of the 
systems and ^ have solution. 

Proof. Q has solution ([3| has no solution. 

Suppose to have a solution k* of Q. Take any vector 
g, multiply it component by component with the equa- 
tions of the system Q and sum over them. Exchanging 
the indeces /i and i in the sums and given the positivity 
of k* , it is straightforward to conclude that no vector g 
can satisfy the system ([s]). 

Proof. ([3]) has no solution ^ Q has solution. 

The demonstration is given by induction in the number 
M of unknows. 

The statment is true for M = 1. Infact, for one un- 
konwn, the system ([s]) is inconsistent if and only if there 
is at least one couple i, j such that £,j£,j ~ < 0, 

and in this case ki = 1, kj = c and ki = V/ 7^ i,j is a 
solution of Q. 

Let's consider a system of the type ([s]) with M un- 
knows and suppose it is inconsistent. We will prove 
that the dual system has solutions supposing the the- 
orem true for systems with M — 1 unknows. We have 
V« ^j'^f/x > -£.f9M, if ^ 0, we can define 



If = -^f /ef , and we have: 

Af-l 

E I'is^ = p^> SM vz : ef < 

M-1 

E 4^5^ = Q3 < 9M Vj- : ef > 

M-1 

Eer3M = ^;>0 VZ: e*^ = 0. (5) 

Writing the system in this form, we can pass to the fol- 
lowing system in M — 1 unknows: 

P^>QJ Vz,j: <0 >0 

i?i > V? : = 0. (6) 

Now, also this system is inconsistent. 

Suppose infact there is a solution g* = (5*), ^ = 

1...A/-I. 

We could add to it any gl^ such that maXjQj{g*) < 
g\i < miniPi{g*) and we will have a solution for the 
original system as well, against the hypothesis. 

By induction hypothesis, the theorem is true for sys- 
tems with M — 1 unknows. Then, referring to ([6]), there 
are fc^ > 0, A:/ > with at least one positive, such that 

^ijiii - ij ) + J2i ^i^i = V^. From this we have 
finally a solution for the system Q: 

3 

%-E^'^-/ef Vz: ef>0 

i 

h yi: C''=0, (7) 

and the theorem is proven. For sake of simplicity the 
case in which some of the fluxes are null is neglected, but 
it follows along the same lines as an application of the 
Motzkin theorem[12). 

Apart from its inherent theoretical interest, this re- 
sult can be used to perform efficiently the energy bal- 
ance analysis of metabolic networks, because it suggests 
to deal with the dual system ^ if we want to find and 
correct flux configurations from infeasible loops (calcu- 
lating and/or correcting a consistent chemical potential 
vector as a by-product). 

An algorithm was provided in|15j complementing stan- 
dard relaxation method [H] (or MinOver scheme [IB]) 
with an exaustive search, in the spirit of recent construc- 
tive demonstrations of the Farkas Lemma JJ, the most 
famous extension of the Gordan theorem. The algorithm 
works by correcting step by step the least unsatisfied con- 
straint, a being constant (Minover) or proportional to the 
amount by which the constraint is violated (Relaxation) , 
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Cycle ID 


Lenght 


Formula 


1 


3 


ACCOAL + PTA2 - PPAKr(R) 


2 


3 


GLUABUTt7(R) - GLUt2r(R) + ABUTt2 


3 


3 


ACCOAL - SUCOAS(R) + PPCSCT 


4 


3 


YALTA - ALATA_L(R) + YALTA 


5 


3 


ADK3(R) - ADKl(R) + NDPKl(R) 


6 


3 


ADKl(R) - ADK3(R) - NDPKl(R) 


7 


3 


NAt3_l(R) - THRt2(R) + THRt4 


8 


3 


NAt3_l(R) - PROt2r(R) - PROt4 


9 


3 


NAt3_l(R) - GLUt2r(R) + GLUt4 


10 


3 


NAt3_l(R) - SERt2r(R) + SERt4 



TABLE L The 10 unfeasible cycles identified from thermody- 
namic feasibility analysis of 10® different randomly generated 
flux configurations for the Escherichia coli metabolic reaction 
network iJR904. Plus (resp. minus) signs indicate that the 
reaction participates in the cycle in its forward (resp. back- 
ward) direction. (R) indicates that the corresponding reac- 
tion is putatively reversible according to [19]. The directions 
of the putatively reversible fluxes PPAKr and SOCOAS turn 
out to be in fact constrained. 




FIG. 1: An infeasible loop that can be present in the 
metabolic network model iJ904 if the putative reversible reac- 
tion PPAKr is considered to work in the backward direction. 



Conclusions 



i.e. a series is defined: 

= mmj^^f5^(Z) (8) 

5/.a + l)=3M(0+<'J, (9) 

till a solution is found, if any, in polynomial time|15j. 
If the algorithm doesn't converge, by the theorem there 
should be infeasible loops. Then, looking at the series 
of distinct least unsatisfied constraints, that unless of 
very bad cases is not large, it is possible to find them. 
The Minover scheme alone was already applied to the 
dual part of the stochiometric matrix of metabolic net- 
works in order to calculate the production profile jT5]. In 
[T5] this algorithm was applied to calculate the list of in- 
feasible loops in the recent model of E.coli metabolism 
iAF1260, here, as a further example, it is reported in Ta- 
ble H] a list of the possible unfeasible loops founded with 
this method in the model of E.Coli metabolic network 
iJR904[T5]. This network is composed of 931 reactions 
(245 of which are reversible) among 761 metabolites. One 
of this loop is depicted in fig(l] from which it is clear that 
e.g. the putative reversible reaction PPAKr sohuld oper- 
ate in the backward direction in order to guarantee ther- 
modynamic feasibility. The thermodynamic feasibility of 
flux configurations in which the presence of these loops is 
removed from the beginning was tested over 10^ random 
instances. The small number of such loops should be 
ascribed to the detailed prior thermodynamic informa- 
tion on reaction reversibility usually provided with the 
network models, whose assessement comes from a careful 
estimate of the chemical potentials |2D|. 



In this article was given a rigourous proof that re- 
action fluxes in a chemical reaction network at steady 
state are thermodynamically feasible if and only if there 
are no closed reaction loops. It was showed that this 
is not a trivial statment but an application of the Gor- 
dan theorem. Infact, it states in detail that calculating 
free energies and/or chemical potentials and assessing the 
thermodynamic feasibility of flux configurations through 
the search for infeasible loops are dual problems whose 
solutions are mutually inconsistent. This in turn pro- 
vides efficient ways to perform the energy balance anal- 
ysis of metabolic networks and a genome-scale example 
was given. From a theoretical viewpoint, the result could 
be extended to a more general set of processes to define 
the underlying thermodynamic variational principle it- 
self: since there are no loops, a potential can be defined 
whose changes have a definite sign. From this point of 
view the theorem extends the Clausius formulation of the 
second law in a network context, that could be an inter- 
esting result for the emerging field of network science pi] 
as well. Further, it would be interesting to investigate the 
statistical basis of this theorem. Regarding applications, 
apart from thermodynamic feasibility, another important 
problem related to flux analysis in constraint based mod- 
eling concerns the link between the network structure 
and its productive capabilities [22]. In a recently devel- 
oped framework inspired by economic systems modeling 
, the fluxes are calculated from the minimal constraint 
that for each chemical species at stationarity the overall 
consumption cannot exceed the total supply (Von Neu- 
mann constraints) , and the aim is to calculate producibil- 
ity profiles and to infer the biomass coefficients directly 
from the stochiometry[18j. The application of the the 
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Farkas lemma in this context, i.e. to the dual part of the 
stochiometric matrix, has been partially shown in |24j 
in order to study the growth media for E.Coli, reveal- 
ing the connection between producibility and conserved 
metabolic pools. However, similarly to this work, these 
theorems of the alternative could provide also interest- 
ing insights for the calculation itself of the producibility 
profile toghether with the conserved metabolic pools, an 
aspect that is leaved for future investigations. 
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